SDSS Log Viewer : visual exploratory analysis of large-volume SQL log data

نویسندگان

  • Jian Zhang
  • Chaomei Chen
  • Michael S. E. Vogeley
  • Danny Pan
  • Ani Thakar
  • Jordan Raddick
چکیده

User-generated Structured Query Language (SQL) queries are a rich source of information for database analysts, information scientists, and the end users of databases. In this study a group of scientists in astronomy and computer and information scientists work together to analyze a large volume of SQL log data generated by users of the Sloan Digital Sky Survey (SDSS) data archive in order to better understand users’ data seeking behavior. While statistical analysis of such logs is useful at aggregated levels, efficiently exploring specific patterns of queries is often a challenging task due to the typically large volume of the data, multivariate features, and data requirements specified in SQL queries. To enable and facilitate effective and efficient exploration of the SDSS log data, we designed an interactive visualization tool, called the SDSS Log Viewer, which integrates time series visualization, text visualization, and dynamic query techniques. We describe two analysis scenarios of visual exploration of SDSS log data, including understanding unusually high daily query traffic and modeling the types of data seeking behaviors of massive query generators. The two scenarios demonstrate that the SDSS Log Viewer provides a novel and potentially valuable approach to support these targeted tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effectiveness of Cognitive Captain's Log Software on Visual-Spatial Perception of Student with Learning Disabilities

Purpose: The purpose of this study was the Effectiveness cognitive Captain's Log software on visual-spatial perception for student with learning disability. Method: This research was a  pretest-posttest design with control group. The statistical population consisted of all students with learning disabilities who were referred to educational and rehabilitation centers of students with specific l...

متن کامل

Visual Tracking using Kernel Projected Measurement and Log-Polar Transformation

Visual Servoing is generally contained of control and feature tracking. Study of previous methods shows that no attempt has been made to optimize these two parts together. In kernel based visual servoing method, the main objective is to combine and optimize these two parts together and to make an entire control loop. This main target is accomplished by using Lyapanov theory. A Lyapanov candidat...

متن کامل

Estimation of Flow Zone Indicator Distribution by Using Seismic Data: A Case Study from a Central Iranian Oilfield

Flow unit characterization plays an important role in heterogeneity analysis and reservoir simulation studies. Usually, a correct description of the lateral variations of reservoir is associated with uncertainties. From this point of view, the well data alone does not cover reservoir properties. Because of large well distances, it is difficult to build the model of a heterogenic reservoir, but ...

متن کامل

Splash: Integrated Ad-Hoc Querying of Data and Statistical Models

This paper presents a system called Splash, which integrates statistical modeling and SQL for the purpose of adhoc querying and analysis. Splash supports a novel, simple, and practical abstraction of statistical modeling as an aggregate function, which in turn provides for natural integration with standard SQL queries and a relational DBMS. In addition, we introduce and implement a novel repres...

متن کامل

Analyzing Engagement in a Web-Based Intervention Platform Through Visualizing Log-Data

BACKGROUND Engagement has emerged as a significant cross-cutting concern within the development of Web-based interventions. There have been calls to institute a more rigorous approach to the design of Web-based interventions, to increase both the quantity and quality of engagement. One approach would be to use log-data to better understand the process of engagement and patterns of use. However,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012